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HIGHER-ORDER METRIC SUBREGULARITY AND ITS APPLICATION^] 

BORIS S. MORDUKHOVIChd and WEI OUYANgH 


Abstract. This paper is devoted to the study of metric subregularity and strong subregularity of any 
positive order q for set-valued mappings in finite and infinite dimensions. While these notions have been 
studied and applied earlier for q = 1 and—to a much lesser extent—for q £ (0,1), no results are available 
for the case q > 1. We derive characterizations of these notions for subgradient mappings, develop 
their sensitivity analysis under small perturbations, and provide applications to the convergence rate of 
Newton-type methods for solving generalized equations. 
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1 Introduction 

This paper mainly concerns the study and some applications of the notions of higher-order metric 
su&regularity and its strong subregularity counterpart. For definiteness, we use the number q > 0 
to indicate the order/rate of the corresponding regularity under consideration. Recall first that a 
set-valued mapping F : X between Banach spaces is metrically q-regular at (better around ) 
(x, y ) € gphF if there exist a number ry > 0 and neighborhoods U of x and V of y such that 

d^x;F~ 1 (y))<'i]d q [y 1 F(x)) for all i6f/ and y G V, (1.1) 

where is the distance function associated with Q. It has been well recognized in nonlin¬ 

ear and variational analysis that metric regularity (q = 1) and the equivalent notions of linear 
openness and Lipschitzian stability play an important role in optimization, control, equilibria, 
and various applications as documented, e.g., in the books [51 HOI [221 126] with many refer¬ 
ences therein. On the other hand, metric regularity often fails for broad classes of parametric 
variational systems given by the generalized equations in the sense of Robinson [25]: 

0 € f(x,y) + Q(y), (1.2) 

where / is single-valued while Q(y ) = dip(y) is a set-valued mapping of the subdifferential/normal 
cone type generated by nonsmooth functions; see |23j and also m ia SHE] [29] for more details and 
further results in this direction. However, this phenomenon does not appear if metric regularity 
is replaced by a weaker property of metric subregularity of F at (x, y) defined by 

d(x] F _1 (i/)) < ry d{y-, F(x)) for all x € U. (1.3) 

Considering ||x — x|| instead of d(x\ F~ 1 (y)) in (jl.3|) . we get the notion of strong metric subregu¬ 
larity. In the aforementioned books and in an increasing number of papers, the reader can find 
more information about these subregularity properties, their calmness (resp. isolated calmness ) 
equivalents for inverse mappings, as well as their various applications to optimization. 

In [6l [OS SB, 30], the authors studied the notion of Holder metric regularity, which corre¬ 
sponds to m with the replacement of d(y,F(x)) by d q (y, F(x)) as 0 < q < 1. Replacing 
d(y; F(x)) by d q {y\ F(x)) in (11.31) as 0 < q < 1 gives us the notion of Holder metric subregularity 
considered recently in eh nans] from different viewpoints while without its strong counterpart. 
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department of Mathematics, Wayne State University, Detroit, MI 48202 (boris@math.wayne.edu). 
department of Mathematics, Wayne State University, Detroit, MI 48202 (wei@wayne.edu). 
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It is essential to mention that there is no sense to study metric (/-regularity of single-valued 
or set-valued mappings for q > 1, since only constant mappings satisfy this property. However, 
it is not the case for (/-subregularity that is equally important whenever q > 0 as demonstrated 
in this paper, where—to the best of our knowledge—the notion of (/-subregularity for q > 1 is 
studied and applied for the first time in the literature. 

In what follows we investigate both notions of metric (/-subregularity and strong metric q- 
subregularity for any positive q concentrating mainly on the higher-order case of q > 1. In 
this way we derive verifiable sufficient conditions and necessary conditions for these notions 
of (/-subregularity in terms of appropriate generalized differential constructions of variational 
analysis, study their behavior with respect to perturbations, and obtain their applications to 
the rate of convergence of Newton’s and quasi-Newton methods for solving generalized equations. 

Accordingly, we organize the rest of the paper. Section 2 contains some preliminaries from 
variational analysis and generalized differentiation widely used in the formulations and proofs 
of the main results given below. Section 3 is devoted to a detailed study of (/-subregularity of 
set-valued valued mappings between general Banach and Asplund spaces concentrating mainly 
on subdifferential mappings. In addition to deriving verifiable conditions that imply and are 
implied by these notions, we compare them (when appropriate) with the corresponding notions 
of metric regularity and provide several numerical examples illustrating the new phenomena. 

Section 4 studies behavior of strong metric (/-subregularity as q > 1 under parameter per¬ 
turbations. The obtained results, being of their own interest, allow us to establish in Section 5 
the convergence rate for Newton’s and quasi-Newton methods of solving generalized equations 
depending on the order of the strong metric subregularity for the underlying set-valued map¬ 
ping in the generalized equation under consideration. Section 6 presents concluding remarks 
and some directions of our future research. 

Throughout the paper we use standard notation of variational analysis and generalized dif¬ 
ferentiation. Recall that, given a set-valued mapping F: X =£ X* from the Banach space X 
into its topological dual X* endowed with the weak* topology w* , the symbol 

LimsupF(x) := { x * € X*\ 3seqs. x k —> x. x* k ^ x* with x* k € F(x k ), k € IV j (1-4) 

x^rx ' 


signifies the sequential Painleve-Kuratowski outer limit of F as x —>• x, where IV := {1,2,...}. 
Given a set 0 C M n and an extended-real-valued function ip: M n —>• M := (— 00 , 00 ] finite at x, 

the symbols x % x and stand for x —» x with x € 0 and for x —» x with ip(x) — > tp(x), 

respectively. As usual, lB(x, r) = lB r (x) denotes the closed ball of the space in question centered 
at x with radius r > 0, while the symbols IB and IB* signify the corresponding closed unit ball in 
the primal and dual spaces, respectively. Finally, given a mapping g: X —>• Y between Banach 
spaces that is locally Lipschitzian around x, we denote 


lip g(x) 


lim sup 

x,u—>x 


\\g{x) - g(u)\\ 
llx — u II 


2 Generalized Differentiation 

In this section we present for the reader’s convenience some basic tools of generalized differen¬ 
tiation widely employed in what follows. We refer to the books [5U22U2BU27] for more details 
in both finite and infinite dimensions. Since the subdifferential and normal cone constructions 
are used below only in Asplund spaces, we confine ourselves to their definitions on this setting. 
Recall that a Banach space is Asplund if each of its separable subspace has a separable dual. 
This class of spaces is rather large including, in particular, every reflexive Banach space. 
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Given p\ X —>• K with x E domy?, the regular subdifferential (known also as the presubdif¬ 
ferential and as the Frechet or viscosity subdifferential) of p at x is defined by 


dp(x) := ju E 


limj^ yW-y^-^x-x) ao | 

x->x \\x — x\\ J 


( 2 . 1 ) 


It reduces to {Vy>(x)} if p is Frechet differentiable at x and to the subdifferential of convex 
analysis if p is convex, while the set dp{x) may often be empty for nonconvex and nonsmooth 
functions as, e.g., for p(x) = — |rc| at x = 0 E R. A serious disadvantage of (|2.1I) is the failure of 
standard calculus rules required in variational analysis and its applications to optimization. 

We come to the different picture while performing a limiting procedure/robust regularization 
over the mapping x eA dp(x) as i 4 r vis the sequential outer limit (11.41) . which gives us the 
(basic first-order) subdifferential of p at x defined by 

dp(x) := Lim sup dp{x) (2.2) 

V 5 - 
x—fx 


and known also as the general, or limiting, or Mordukhovich subdifferential. In contrast to (12.11) . 
the set (12.21) is often nonconvex (e.g., dp(0) = {—1,1} for p(x) = —|x|) enjoying nevertheless 
comprehensive calculus based on variational/extremal principles of variational analysis. 


3 Necessary and Sufficient Conditions for g-Subregularity 

Let us start this section with the basic definition of positive-order metric subregularity for 
arbitrary set-valued mappings between Banach spaces. 

Definition 3.1 (metric q-subregularity and strong q-subregularity). Let F : X Y 

with (x,y) E gphF, and let q > 0. We say that: 

(i) F is metrically q-SUBREGULAR at (x, y) if there are constants r/, 7 > 0 such that 

d(x\ F - 1 (y)) < r]d q (y-, F(x)) for all x G ®(s; 7 ) (3.1) 

The infimum over all constants/moduli rj > 0 for which (13.11) holds with some 7 > 0 is called 
the exact q-subregularity BOUND of F at (x,y) and is denoted by subreg q F(x,y). 

(ii) F is strongly metrically g- subregular at (, x , y) if there are r), 7 > 0 such that 

\\x — x\\ < r]d q (y;F(x)) for all x E JB(x\ 7 ). (3.2) 

The infimum over all rj > 0 for which (13.21) holds with some 7 > 0 is called the exact strong 
Q-subregularity bound of F at (x,y) and is denoted by ssubreg q F(x,y). 

For brevity, in what follows we omit the adjective “metric” for q-subregularity. It is easy to 
see from the definitions that the strong q-subregularity of F at (x, y) implies the corresponding 
q-subregularity of F. Furthermore, the validity of q-subregularity (resp. strong q-subregularity) 
of F ar (x, y) for the fixed number q > 0 ensures this property for any 0 < q < q. 

Clearly, the larger q in the above subregularity properties the better the corresponding 
estimate (error bound) in (13.11) and (13.21) is. The following simple one-dimensional example 
shows that it makes sense to consider the q-subregularity property of order q > 1, in contrast to 
its metric regularity counterpart of such (higher) orders, even in the case of real functions. 

Example 3.2 (q-subregularity of higher order). Consider the continuous function f{x) := 
|x|5, x E M, which is not Lipschitz continuous around x = 0. We have 

\x\ < \x?\ q for any q E (0,2] and all x E 1B( 0, 1). 

This shows that / is strongly q-subregular at (0,0) whenever q E (0,2], 
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The next example is more involved, being still one-dimensional, and reveals an interesting 
phenomenon: a set-valued mapping may not be metrically regular around the given point while 
it is metrically subregular at this point with some q > 1. This example concerns in fact solution 
maps of the parametric generalized equations of type <tm which fails to have the metric 
regularity property in common situations; see Section 1. 

Example 3.3 (g-subregular but not metrically regular solution maps to parametric 
generalized equations). Consider the solution map 


S(x ) = {y G R n \ 0 <E f(x,y) + Q(y)} 

of the parametric generalized equation m with f(x,y) := x and Q: R R given by 

[2 fc +l ’ 2^] f 0T y — ( <y2) k ’ 

f° r y G ((^2)fc+i ’ ) ’ 

Q(y) := 0 for y = 0, 

for y = ~j^k, 

2 k + 1 f 0r y ^ ( ( f/2) k ’ (v / 2) fc + 1 , 


(3.3) 


as depicted in Figure 1. Then S is not metrically regular around (0,0) while it is strongly 
q-subregular of any order q G (0, 2] at this point. 



Figure 1: Q(y) 


Indeed, the failure of metric regularity of S in (13.31) around (0, 0) follows from more general 
results of |23| . Let us verify this directly for the mapping S under consideration. Due to the form 
of S in (|3.3I) and the well-known equivalence between metric regularity of the given mapping 
and the Lipschitz-like/Aubin property of its inverse (see, e.g., |22[ Theorem 1.49]), it suffices to 
show that Q in (13.3(1 is not Lipschitz-like around (0, 0). By [22L Theorem 1.41] this is equivalent 
to the fact that the scalar function 


p(x,y) := dist(x; Q(y)) = inf {||x - -y|| | v G Q(y)} 


is not locally Lipschitzian around (0,0). To check the latter, we construct two sequences 
{(xik,yik)} and {(x 2 k,U 2 k)}, which converge to (0,0) when k —>• oo as follows: 


1 1 

x iky ik ~ afc ’ 


where 0 < < min 


/- 

\k2 k, (^ 2 ) k 




1 

%2k • yk—l 5 


1 

(v^2) fc_1 ’ 


k € 1ST. 
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Then we have the equalities 


dist(x 2 fc; Q{y 2 k)) = dist [x 2 k] 


1 1 


2fc ’ i 


= dist 

1 1 


1 


1 1 


dist(x lk -Q{y lk )) =dist(^ZT;-t) = 
where the last one holds due to the estimates 


2&-1’[2 fc ’2 fc_1 
1 


= 0, 


< yik < . o ^ , k £ IN. 


(V / 2) fe ^ (^2) fc - i: 

Since ||yi fc - 2/2fc|| = < (&2 fc ) -1 , we have 


||d(a;ifc;Q(2/ifc)) - d(x 2 k]Q{y 2 kj) II = ^ > = Aj|(xi fe - x 2 k,yik ~ 2/2fc)||, 


which indicates that p(x, y) is not locally Lipschitzian around (0,0) and yields therefore that 
the solution map S from (|3.3D is not metrically regular around (0,0). 

Now we show that S is y-subregular for q = 2 and thus for any q £ (0, 2] at (0,0). To proceed, 
take r/ = 7 = 1 and, given x £ B{ 0,7), find ko so that |x| € [y^j+r, Tjry] and the corresponding 


value | Q x (x)| belongs to 


TW*’ TW k 


Notice that S 1 (0) = {0}, we have 


d(x-S ^O)) = \x\ < rjd 2 (0;S(x)) = r/(inf {||y||| y £Q 1 (x)}) 2 , x £ 0,7), 

due to j lt for any k > 2, which verifies the 2-subregularity of 5 at (0, 0) and thus 

completes our justification in this example. 

It is worth mentioning that the solution map (13.311 in Example [3T3] happens to be even strongly 
2-subregular at (0,0). It follows from the arguments above since S^ 1 (0) = {0} is a singleton. 

Next we derive characterizations of y-subregularity of any rate q > 0 for the subdifferen¬ 
tial mappings (|2.2|) generated by extended-real-valued lower semicontinuous (l.s.c.) functions 
on Banach (sufficient conditions) and Asplund (necessary conditions) spaces. For the case of 
subregularity (q = 1) the obtained characterization reduces to [Til Theorem 3.1]. For convex 
functions on Banach spaces this case while concerning only local minimizers x of / has been 
independently characterized in [I, Theorem 2.1] with a weaker modulus estimate; see more 
discussions in m presented around Corollary 3.2 and also in [TJ Remark 2.2] for convex and 
nonconvex functions with q = 1. In the general case of g-subregularity the formulation and 
proof of the theorem below are essentially more involved following the lines of the approach in 
|M , Theorem 3.2] (for strong metric regularity) and of [111 Theorem 3.1] (for subregularity). 


Theorem 3.4 (characterization of g-subregularity of the basic subdifferential). Let 

f: X —>• R be l.s.c. around x £ dom/ on a Banach space X, let x* £ df(x), and let q be an 
arbitrary positive number. Consider the following two statements: 

(i) df is q-subregular at (x, x*) with modulus k and there exist numbers 7 > 0 and r £ (0, q/n) 
with k := qKi such that 


/(*) > /(*) + (**,*- x) - -j^—d i (x; (df) X (x*)) for all x £ lB(x, 7). (3.4) 


+ q 


(ii) There are two positive numbers a and r/ such that 


f{x) > /(x) + (x*, x — x) + Y^—d s (x; (df) : (x*)) for all x £ lB(x, rj). 


+ q 


(3.5) 
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Then we have (ii)=>(i) provided that there is (3 E (0, a) with 


f(u)> f{x) + (x*,u-x)-—j—d 1 (x,(df) 1 (x*)) (3.6) 

1 + q 

1 

whenever (it, x*), (x, x*) E (gph<9/) n JB[(x,x*),ri + Conversely, we have (i)=>(ii) 

for any fixed a E (0 ,q/n) provided that the space X is Asplund. 

Proof. Let us first justify implication (ii)^=>(i) in the case of the general Banach space X 
assuming condition (13.61) with some /3 € (0, a). Since (13.51) clearly yields (13.41) . we will arrive at 
(i) by showing that there exists a number k > 0 such that 

d(x; (d/) _1 (x*)) < «d 9 (x*; d/(x)) for all x € 1B[x, qq/1 + q). (3.7) 

To proceed, fix x € JB(x,r]q/l + q). Pick any u E (<9/)~ X (x*) with ||x — u\\ < q~ 1 \\x — x|| and get 

||it — x|| < || u — x|| + ||x — x|| < r]. 

Then it follows from (13.61) the estimates 

fix) > f(u) + (x*,x-u)~ Y^d~X (u- ( df)~ 1 (x*)) > /(it) + (x*,x-u), (3.8) 

1 

which ensure in turn that for any such u and x* E df(x) D JB[x*, [rjq/( 1 + q)] i ) we have 


(x* — x*,x — u) = (x*, x — u) + (x*, u — x) — (x*, x — x) 

> fix) - /(it) - -^-dX~ (x; (<9/) _1 (x*)) + /(it) - f{x) - (x*,x - x) 
1 + q 

= fix) - fix) - {x*,x - x) - (s; ( df ) _1 (x*)) 

1 + q 

where the first inequality follows from (13.61) and (13.81) while the second one from (13.51) . Thus 

ii * —*n ii ii \ qi^ fi) jhfca ^ f 1 t—*\\ 

\\x — x || • ||x — u|| > - ——a q (x; [of) ( x )), 

which gives us the estimate 

Ik* - x*\ |d(x; (<9/) _1 (x*)) > Q ^ + d±v ~ (x; (<9/) _1 (x*)) 
due to the arbitrary choice of u E (d/) _1 (x*) with ||x — it|| < |||x — x|| < Hence 


| * — * 11 \ 
\x — x > 


q{a + ^ ds (x; ( df ) 1 (x*)) for all x* E df{x) D JB (x*, [r/q/ (1 + g)] ^). (3.9) 


If d(x*;5/(x)) < {r/q/1 + q) q , we deduce from (|3.9I) that 

rf(x*;<9/(x)) > ^ d? (x; (t?/)~ 1 (x*)) for all x E JB[x, r]q/{l + q )), 
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which justifies ([3.71) with k := [(1 + q)/(q(a — (3)] g . In the remaining case of d(x*;df(x)) > 

i 

(rjq/1 + q)i we obviously have the estimates 

d(x- (<9/) _1 (.t*)) < ||x - x\\ < < d q (x*\df(x)), 

which also justify (I3.7j) is and thus completes the proof of implication (ii)=^(i). 

Next we verify the converse (i)^=>(ii) assuming that X is Asplund. Arguing by contradic¬ 
tion, suppose that (i) holds while property (13.51) is not satisfied whenever a, rj > 0. Choose 

1 + 9 

2~ 

0 < -- < a < oo 

i + q 


and pick 6 from the interval ( q fi+ q ) > 5 )> which ensures that 6 + gq a q^i +q ^q < 1- Now we claim 
that there exists a positive number v satisfying 

f(x)>f(x) + (x*,x-x) + -— \^ d~ (x; (df)~ 1 ( x*)) for all x G lB(x,u). (3.10) 

a(l + q)K v ' 

Indeed, otherwise there is a sequence x k —> x such that 

f(xk) < f(x) + {x*,x k - x) + -— (a q + ^ hr d^r [x k - (df)~ l (x*)), k E M. 

a(l + q)K 


This together with (|3.4I) implies that all x k lie outside of ( df ) 1 (a:*). Consequently we have 

qr i±2 


inf 

x£lB(x, 7 ) 


{/(*) - (x*,x - x) + Y^ d q (®; (df) 1 ( x *))} > /(*) 


> f(x k ) - (x*,x k -x) + Y^d « 9 (x k ;(df) x (x*)) - 


q — nr 1+9 


a(l + g)«; 


d « (x k ;(df) 1 (x*)). 


(3.11) 


1+9 


Denote e k := ( a ^ +q) K ) d q ( x k\ (df ) 1 (x*)) j, 0 as k —>• 00 and dehne the function 


5 (») :=/(®)-<®*,a:-®) + 7 ^— d +q9 (x](df) V*)), x € X. 

1 + q 

It follows from (13.111) that g(x k ) < inf xeiBtx,^) d( x )+£k- Applying Ekeland’s variational principle 
(see, e.g., m Theorem 2.26]) to the function g + with X k := 0 d(x k ; (df) 1 (x*)) ensures 

the existence of a new sequence x k satisfying \\x k — ccfc || < X k and such that for each k E IV we 
have x k E intlB(x, 7 ) (due to x k — >• x and X k j. 0 as A: —> 00 ) and that 


g( x k) < g(x) + ^~\\x-x k \ 


for all x € lB(x, 7). 


Employing the Fermat stationary rule in the above optimization problem and then using the 
subdifferential sum rule held in Asplund spaces by [22] Theorem 3.41], we arrive at 


0 € d^g(-) + -^-H ■ -x||)(s fc ) 

C -x* + df(x k ) + y~JB* + dd~X (•; ( 0 /) _1 (x*))(x fc ) 

x k 1 + q 

C -x* + df(x k ) + ( rd 9 (x k ] (< 9 /) _ 1 (x*)) + 
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Combining this with the metric q-subregularity property of df at (x,x*) ensures the estimates 

d« (xk^df)- 1 ^*)) < -d(x*;df(x k )) < - (rd* (x k ; {df)~ 1 {x*)) + 

q q \ Afc/ 

for all k E IN sufficiently large. Hence for such numbers k we get the inequality 

(l --)d^(x k ;(df)-\x*))<-^. 

\ Q / Q Ah 

This allows us to successively deduce that 

(l - d(x k ; (d/) -1 ^’)) < (l - “ yy^kridf) -1 ^)) 

nr 
Q 


\q\ k ' 

< ( 1 _ -^) q d(x k -,{dfy 1 {x*))[e + 

< (l - — yd(x k -,(df)^ 1 (x*)), 


a^9 c ‘{q + l) c ‘ 


q 

where the last strict inequality follows from our choices of a and 9. Thus we arrive at the obvious 
contradiction, which justifiers our claim in (13.101) . We conclude therefore that q — {aq + l)nr < 0. 
Define now the real number 


qg+l 


r\ := 


nr — 1 

€ [0 ,q/n) 


an 


and observe that inequality (13.101) can be transformed into (13.41) with replacing r by r\ and 7 
by v, respectively. Consequently there is some real number 17 such that 

f{x)>f{x) + {x*,x-x) + - — ^ ^ ^ 1 d~ (x; (^/)~ 1 (x*)) for all x € JB{x,u 1 ). 


As before, we get the inequality nr 1 > ^ 77 , or equivalently nr > _|_ ^229Lp.. Defining 


qg+l 


r 2 '■= 


nr\ — 1 

C 0. <!' k) 


an 


^ 2 3 

and proceeding in the same way as above lead us to the inequality nr > a ^ +{ + ( a qhyi + (aq+i)* ■ 
Then we get by induction the progressively stronger bounds 


nr > 


+ 


aq 


aq+ 1 {aq + l) 2 


a k 1 q k . 

+ • • ■ + 7- 1 \ 1. = Q 1 — 


{aq + l) k 


aq 


aq+1 


for all k E IN. 


Letting k —>• 00 gives us Kr > g, which is a contradiction. Therefore there exist real numbers 
a, r] > 0 such that inequality (13.51) is satisfied. 

To justify (ii), we verify now that a may be chosen arbitrarily close to q/n while being 
smaller than this number. It suffices to consider the case of a < q/n. Take a, 9 > 0 such that 


q‘2 7 i 1 
o(l +q) < 2’ 


9 E 


2? In 

a(l + g)’2J’ 


and so 9 + 


q 

9a{ 1 + q) 


Q 

< 1. 
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Given a, r] > 0 for which (|3.5p holds, let us prove the existence p, > 0 such that 


f(x) > /(x) + (x* , x - x) + i (x-(df) 1 (s*)) for all xeJB(x,p). (3.12) 

&\ J- i q)k 

We just sketch the proof of (13.121) observing that it is similar to the proof of (13.101) given above. 
Arguing by contradiction, find a sequence of Uk —» x so that 

f(uk) < f(x) + ( x*,u k - x) + g(g + ( ° (n fc ; ( df)~ l {x *)), fc E W. 

a(l + q)K 


This gives us together with (13.51) that 

inf {/(x)- (x*,x-x) - -^d^(x;(<9/) _1 (x*))j > f{x) 

x&JB(x,rj) 1 1 + q J 

> /(«fc) - (z\ufc - x) - («fc; (d/) _1 (®*)) - ~n { u k ; (d/) -1 ^*)). 

1 + q a(l + g)/« 

Denote further := d~ («&; (9/) _1 (x*)) j. 0 as fc —» oo and consider the function 

h{x) := /(x) - (x*,x- x) - -^—d^T (x; (<9/) _1 (x*)), x € X, 

1 + q 

for which we have h(u k ) < inf x eiB(x,rf) h{x)+Vk whenever k E IN. Applying Ekeland’s variational 
principle to the function h + 8]B(x,rj) with p k := 6d(uk\ (5/)“ 1 (x*)) ensures the existence of a 
new sequence {uk} satisfying || u k — u^W < Pk and such that Uk E int lB{x,rj) with 


h(uk) < h(x) 4—-llx — Ufell for all x € JB(x, 7), k € ]N. 

Pk 

By the calculus rules as above we get the inclusions 

0 E d(h(-) + j-\\ • -x||)(« fc ) C -x* + df{u k ) + (ad* (u k ; (<9/) _1 (x*)) + 

which ensure together with the g-subregularity of df at (x,x*) that 

(x fc ; (3/) _1 (x*)) < -d(x*,df(x k )) < - (ad* (u k ; (3/) _1 (x*)) + — 

q q v Pk 

for all k € JN sufficiently large. Thus for such k we arrive at the estimate 

(l _ —)d! (<9/) _1 (x*)) < — —. 

\ q y q Pk 

This allows us to successively deduce that 

/ Ka\Q . / Ka\Q. 

[l - -) d(u fc ; (9/) (x )) < ( 1 -—) I 

Ka\<i 


IB* 


Uk - Uk II + 1 - 


/ca\ 9 


1 d(u k \ (df) \x*)) 




< 


^qpkJ 

(f - (S/T 1 ^*)) (^ + ( 


< (1 - -^y d {uk;(df) 1 (**))> 


a0(<? + 1) 
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which is a contradiction justifying the existence of /j > 0 such that (13.1211 holds. 

In the last part of the proof we define the number 

(q + (a — l)afid , , x 

«l := ~ ~ - 1 - 1 € 0 ,q k) 

an x ' 

and observe that (|3.12j) yields (|3.5|1 with this number a\ and 7] = fj, from (|3.12ll . Then proceeding 
as above allows us to find hi > 0 such that 

f[x ) > f(x) + (x*,x- x) + ~ d~^ (x; (d/) -1 ^*)) for all xeB(x,n i). 

a(l + gjK 

Define further := - ' f ' AT —'- E (0 ,q/n) and deduce from the above inequality that (13.511 
holds for «2 and 77 = yUi. By induction we find sequences {a?;} and {nk\ satisfying 

f(x) > f{x) + (x*, x - x) + Y^j akd ~ for a11 x G 

with Ofc := — E (0 ,q/n) for A: € IN and ao = a. Letting finally k —>• 00 gives us 

a k ~^ q/n and thus completes proof of the theorem. A 

Similarly to [III Corollaries 3.2, 3.3, 3.5] given in the case of q = 1, we can easily deduce from 
the obtained Theorem 13.41 its consequences for local minimizers as well as the characterization of 
strong q-subregularity in the case of arbitrary q > 0. After completing and submitting this paper, 
we were informed by Xi Yin Zheng that implication (i)=^(ii) of Theorem 13.41 could be deduced 
from m Theorem 4.1 (i)] in the case when x is an isolated minimizer of /. If in addition to this 
the function / is convex, then implication (ii)=^(i) could be deduced from [30] Theorem 4.1(h)] 
with the different proofs therein. 

Now we present two examples illustrating the results of Theorem 13.41 and the assumptions 
made therein. The first example shows that the conditions of the theorem ensures the validity 
of 2 -subregularity of the subdifferential mapping while metric regularity fails. 


Example 3.5 (2-subregularity versus metric regularity of the subdifferential). Con¬ 
sider the convex and continuous function /: R —>• R + defined by 


/(*) 


—x for x < — 1 , 

1 for — 1 < x < 1 , 
x for x > 1 . 


It is not hard to calculate its subdifferential mapping as follows: 


df{x) = < 


{- 1 } 

for 

- 1 , 0 ] 

for 

{ 0 } 

for 

[ 0 , 1 ] 

for 

{ 1 } 

for 


x < — 1 , 
x = — 1 , 

— 1 < X < 1 , 
X = 1 , 

X > 1 . 


Take x = 0, x* = 0 E df( 0) and show that the mapping df is 2-subregular (and hence q- 
subregular for any 0 < q < 2) at (0, 0) while not metrically regular around this point. To verify 
2-subregularity, it suffices to check by Theorem 13.41 that both conditions (13.511 and (13.611 of this 
theorem hold with q = 2. To proceed, take a = 1 and 77 = | in Theorem I3.4f iil and observe 
that f(x) = /(0) = 1 for any x E Then for any q E (0,2] we have 

f{x) > /( 0 ) + l;d 1+ « (x; (<9/) - 1 ( 0 )) 
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due to ( df ) 1 (0) = (—1,1), and so condition (13. 5 j) holds. To verify ()3.6p . take /3 = 0.5 and get 
f(u) = f(x) = l for any (u, x*), (x, x*) € gph (df) n JB((x, x*), rj + (r?/ 2 ) «), 

which justifies (13.61) and thus shows that df is 2-subregular at (0,0) by Theorem 13.41 

However, the validity of conditions (|3.5I) and (|3.6I) do not guarantee that the mapping df 
is metrically regular around (0,0). Indeed, we have for any x, y / 0 sufficiently close to 0 that 
df(x) = {0} while the values of (df)^ 1 (y) are either {—1} or {1}. Then it is easy to see by 
considering two sequences { 27 } = {A: -1 } and {y k } = {( 2 fc) -1 } that there is no positive number 
k, which ensures the distance estimate 


d(x k ; (df) 1 (y k )) < nd(y k ] df (x k )), k € IV, 


i.e., the subdifferential mapping df fails to be metrically regular around ( 0 , 0 ). 

The next example shows that condition (13.61) is not necessary for q-subregularity of df 
whenever q > 0. In particular, this illustrates that (i) implies (ii) but does not imply (13.6j) . 

Example 3.6 (on assumptions and conclusions of Theorem 13.41) . Define /: M —» M + by 


f(x) := < 


x 2 for x > 0 , 
0 for x = 0 , 


(— x) 2 for x < 0 . 
It follows immediately that df(x ) = {f'(x)} at x 7 ^ 0 with 


/'(*) = 


7 ^ 7 = for x > 0 , 

2y/X 

1 for x < 0 


2 \J—X 


and that [—1,1] C df( 0). The latter implies that df x (0) = {0}. Considering now x = 0 and 
x* = 0 , we claim that the mapping df is y-subregular at ( 0 , 0 ) with any positive order q. which 

_ 2 q_ 

we fix in what follows. To proceed, take 7 := 2 1+ 2 and get 

|x| < (l/2 ) 2+q <(=> |x| < (l/(2\/\x\)) q for x G 1B( 6 , 7 ), 


which verifies the validity of the y-subregularity condition ()3.1D for the mapping df. 

However, condition (13.61) fails here whenever /3 ,77 > 0. To justify it, we argue by contradiction 
and suppose that ()3.6I) holds with some (3o,r)o- It is easy to see that the inclusion 

(u, 0),(x,x*) € gph (df) D IB ^(0,0), yo + (^^) 5 ) 
yields u = 0, which implies in turn that condition (13.61) reduces to 


0 > f(x) + (x*, —x) 


q/3p 

1 + q' 


\x\ 


q + l 
q 


Considering x > 0 in (13.13|) gives us the inequality 


(3.13) 



q/3o 1 , 2 +i 

1 ^,1 q 


1 + 9 


\x\ 


i.e., 


2g/3o . 
1 + q 


q+l 

<1 


1 

> X 2 , 


which is a contradiction, since the latter inequality is obviously violated for x sufficiently small. 
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4 Strong Metric q-Subregularity under Perturbations 


It has been well recognized in the literature that metric subregularity, in contrast to metric 
regularity, is not robust/stable with respect to perturbations of the initial data; see, e.g., [TO] . 
In this section we show that the situation is different for strong subregularity and higher-order 
g-subregularity (q > 1). Namely, it is proved below that such strong g-subregularity is stable 
with respect to appropriate perturbations of the initial set-valued mapping by Lipschitzian 
single-valued ones. Furthermore, we estimate the exact bound of strong q-subregularity moduli 
together with the radius of perturbations that keep strong g-subregularity of the perturbed 
mappings. These results are used in Section 5 for establishing the convergence rate for a class 
of quasi-Newton methods depending on the order q of the assumed strong (/-subregularity of the 
initial mapping. Unless otherwise stated, we have q > 1 in the rest of this section. 

Theorem 4.1 (strong g-subregularity under Lipschitzian perturbations). Let F: X =f Y 

be a set-valued mapping between Banach spaces with (x,y) € gphF, and let g : X —>• Y be a 
single-valued perturbation locally Lipschitzian around x. If there exist k, A € (0,oo) such that 

ssubreg q F(x,y) < k and lip g(x) < A < (k«) , 


then we have the modulus upper estimate 


ssubreg q (F + g)(x,y) < 


with F(x) := F(x) — g(x). 


(1 - A ki)i 

Furthermore, if ssubreg q F(x, y) > 0, we have the modulus relationship 

ssubreg q F(x, y) 


ssubreg q (F + g)(x,y) < 


(4.1) 


(1 — (ssubreg c >F(x,y)) q lip g(x) q 

1 

whenever (ssubreg F(x,y)) q lip g(x) < 1. 

Proof. Since ssubreg q F(x, y) < n and lip g(x) < A, there exists 7 € (0, 1) such that 
\\x — x\\ < nd g [y] F(x)) and \\g(x) — g(x)\\ < A||x — x||, x € JB(x, 7 ). 


(4.2) 


For such x , take z € F(x) + g{x) and find y € F(x) such that z — y = g(x) — g{x). It yields 


\x - x\\i < ki ||y - y || < ki (||y - z\\ + ||y - z\ 


11 III 

< n q \\y — *|| -\- K q A||x — x|| < K q \\y — z\\ + K q A||x — x|b, 
where the last inequality holds due to q > 1 and 7 < 1. It gives us the estimate 

||x — x|| < --—j—||y — z\\ q for all x € lB(x, 7 ), z € F(x) + g(x), 

(1 - \K~ q Y 

which implies both inequalities (|4.1jh (|4.2h and so completes the proof of the theorem. A 

Next we derive two useful consequences of Theorem 14.11 of their own interest. The first 
one concerns strictly differentiable (in particular, C' 1 -smooth) perturbations and is employed to 
establish convergence rates of the quasi-Newton methods considered in Section 5.. 
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Corollary 4.2 (strong g-subregularity under smooth perturbations). Let in the setting 
of Theorem 14.11 the perturbation g is strictly differentiable at x, and let y E F(x) + g{x). Then 
the mapping x i-4 F(x) + g(x) is strongly q-subregular at (,x, y) if and only if the mapping 
G: i 4 g(x) + Vq(x)(x — x) + F{x) is strongly q-subregular at (x,y) with the exact strong 
q-subregularity bound ssubreg 9 (F + g)(x,y). 

Proof. Pick k € (0,oo) such that ssubreg 9 (F + g)(x) < k. Define the mapping 

g(x) := Vg(x)(x - x) + g(x) - g(x) 

and observe that g(x) = 0 and lip g(x) = 0 since g is strictly differentiable at x. Thus it follows 

from Theorem 14.11 that the mapping x i-t (F + g) (x) + g(x) = F{x) + g(x) — g(x) + q(x) = G(x) 
is also strongly q-subregular at (x, y) with the exact bound not exceeding k. The proof of the 
converse implication is similar with replacing g by —g. A 

The second consequence of Theorem 14.11 provides a lower estimate of moduli of Lipschitzian 
perturbations, which fails strong g-subregularity of the original mapping. 

Corollary 4.3 (perturbation radius for failure of strong q-subregularity). In the setting 
of Theorem 14.11 we have the following estimate: 

inf {lipq(x)| F + g is not strongly metrically q-subregular at (x,y) ) (4-3) 

g: X— k 1 1 

1 

>---r- 

(ssubreg^F(x, y)) q 

Proof. We split the proof into considering the three possible cases in the theorem. 

(i) If ssubreg q F(x,y) = oo, then the right-hand side of (14.311 is zero. Observing that for 
g = 0 the mapping F is not strong q-subregular, we conclude that the infimum in ((4.3(1 the is 
also zero, and thus the inequality therein holds. 

(ii) If ssubreg 9 F(x, y) = 0, then the right-hand side of (14.31) becomes oo. For any g : X —>Y 
with lipq(x) < oo we deduce from Theorem 14.II that the mapping F + g is strongly q-subregular 
as well. Hence the infimum in (14.31) is also oo, and thus the inequality holds therein. 

(iii) Consider the major case of 0 < ssubreg q F(x, y) < oo and suppose that (14.3(1 is violated. 
Then we find a mapping g : X Y locally Lipschitzian around x such that 

lip g(x) (ssubreg q F(x,y)) q < 1 

and the mapping F+g is not strongly q-subregular at (x, y). This clearly contradicts Theorem l4.ll 
and thus completes the proof. A 

The concluding result of this section concerns strong q-subregularity of parameterized map¬ 
pings being important, in particular, in the framework of Section 5. 

Theorem 4.4 (strong q-subregularity of parameterized mappings). Let F : X Y 

be as above, and let g: X —>• Y be C 1 -smooth around x. Assume that the mapping G: x 4 
g(x) + Vq(x)(x — x) + F(x) is strongly q-subregular at (x,y) with y E G(x). Then for any 
A > ssubreg q G(x, y) there exists 7 > 0 such that the parameterized form of G defined by 

xi-4 G(u, x) := q(x) + Vq(u)(x — x) T F(x) with uGl B(x,y) 

is strongly q-subregular at (x,y) with modulus X, i.e., there is rj > 0 for which 

\\x — x\\ < Xd q (y-,G(u,x)) whenever x E IB(x,r/). 
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Proof. Take A > k > ssubreg q (G,x,y) and select /x > 0 so that 

Kj 1 

A > -j— and ///«« < 1. 

(1 — [XKQ )9 

By the assumed C 1 property of g, find 7 > 0 such that 

||V< 7 (?x) — Vp(x)|| < g for all u E lB(x, 7 ). 

Fix further u as above and define a new parameterized mapping g : X —> Y by 

g{x) := g(u) + Vg(u){x - u) - g(x) - S7g(x)(x - x). 

Then we have g(x) = g(u) + Vg(u)(x — u) — g(x), and hence 

\\g(x)-g(x')\\ = \\Vg(u)(x - x')-Vg{x)(x - x')\\ 

< \\Vg(u) — Vp(x)|| • \\x — x'|| < g\\x — x'll 

for any x,x' € lB(x, 7 ). Thus lip^(x) < ji. Applying Theorem 14.11 to the mappings G and g 
ensures that the mapping 

x i->- G(x) + g{x) = G(x) — g(x) + g(x) 

g(x) + Vg(x)(x — x) + F(x) + (\7g(u) — S7g(x))(x — x) 

= g(x) + Vg(u)(x — x) + F{x) = G(u,x ) 

is strongly g-subregular at (x, y) with th exact bound not exceeded A. This tells us that for any 
u € IB(x, 7 ) there is 7 > 0 such that 

\\x - x|| < Xd q (y-G(u,x)) for all x £ lB(x,r]), 

which thus completes the proof of the theorem. A 

5 Applications to Quasi-Newton Methods 

In this section we discuss some applications of the results on strong (/-subregularity under per¬ 
turbations obtained in Section 4 to the convergence rate for a class of quasi-Newton methods to 
solve generalized equations given in the form 

0 G g(x) + F(x), (5.1) 

where g : X —>• Y is a single-valued mapping while F: X =£ Y is a set-valued mapping between 
Banach spaces. As in j9], we consider the following class of quasi-Newton methods to solve (15.1|) : 

0 € g(x k ) + B k (x k+ i - x k ) + F(x k +i), k = 0 , 1 ,..., (5.2) 

where B^ signify a sequence of linear and bounded operators acting from X to Y. In the 
case of Bh = Xg(xk) algorithm (15.21) corresponds to Newton’s method while particular choices 
of the operator sequence {-£>&} make it possible to include in this scheme various versions of 
quasi-Newton methods; see more discussions in mm- 

Let 1 be a solution to m, and let { 07 } be a sequence generated by (15.2|) that converges 
to x. The classical Dennis-More theorem [ 8 ] Theorem 2.2] for F = 0 establishes a certain 
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characterization of the superlinear convergence (called in [8] the Q-superlinear convergence, 
where Q stands for “quotient”) of the quasi-Newton iterations 


linr ^ fc+1 = 0 (5.3) 

k—y oo *2/11 


under the smoothness of g: M n —> M n and nonsingularity of its Jacobian \7g(x). Recently [9], 
Theorem 3], Dontchev extended this result to the case of generalized equations (15.11) assuming 
that the mapping x i->- g(x) + S7g(x)(x — x) + F(x) is strongly subregular at (x,0), which reduces 
to the nonsingularity of Vg(x) in the setting of |8j. 

The following theorem imposes the q-subregularity of F at (x, —g(x)) as q > 1 and shows, 
by using the approach somewhat different from both papers mm and based on the stability 
results of Section 4, that we have the higher convergence rate 


||x fe+ i-a:| 

Inn —-—— 

k-> oo \\Xk — 


= 0 , 


q > i, 


(5.4) 


which reduces to the superlinear one in (15.31) for q = 1. 


Theorem 5.1 (convergence rate for quasi-Newton iterations). Let x be a solution of the 
generalized equation m, where g: X Y be a mapping between Banach spaces that is C 1 - 
smooth on some convex neighborhood U of x. Given a starting point xq E U and a sequence of 
linear and bounded operators Bk'- X —>• Y, consider the corresponding sequence {xk} generated 
by (15.21) such that { xk} C U and x^ —» x as k —>• oo. Assume that the set-valued mapping 
F: X Y in () 5.1 [) is strongly q-subregular at (x,—g(x)) with some q> 1 and that there exist 
positive numbers k. A for which 


ssubreg' ? F(x, — g(x)) < k and lipg(x) < A < 1 . 

Suppose also that Xk ^ x for all k G IN. Then we have the implication 


lim II (^fc - Vg(^))(^fc+1 - a?fc)|l _ ■ 

Lfe^-oo \\xk+i ~ Xk\\ 


(E2D- 


(5.5) 


(5.6) 


Proof. By (15.51) it follows from Theorem 14.11 that 


ssubreg q (F + g)(x,—g(x)) < - T — with F(x) = F(x) — g(x). 

(1 — 

Then Corollary 14.21 tells us that the mapping G: x g(x) + Vg(x)(x — x) + F(x) = S7g(x)(x — 

x) + F(x) is strongly q-subregular at (x, —g(x) with modulus g < -—. To prove implication 

(l-A K®)® 

(15.61) with q > 1, take any small e > 0 and find a natural number ko sufficiently large so that for 
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all k > fco we have the relationships 


lkfe+i - x\\i < n q d( ~ g(x)]G(x k+ 1 )) = n«d(- g(x)\ S7g(x)(x k+ i - x) + F(x k +i)) 
1 

< g q \\- g(x) + g(x k ) + B k (x k+ 1 - X k ) - S7g(x)(x k+ 1 - x)|| 

= V q \\g(x k ) - g(x) - Vg(x)(x k -x) + (B k - Vg(x))(x k+1 - x k )\\ 

/ S7g(x + t(x k - x)) (x k - x)dt- Vg(x)(x k - x) 

J o 


< 


1 

fl q 


+ g q \\B k - Vg(x)\\\\x k+1 - x k \ 


< g~‘ 


(Vg(x + t(x k - x)) - Vg{x)) ( x k - x)dt + gie\\x k+1 - x k \ 


< fiie\\x k — x\\ + gie\\x k+ i — x\\ + fiie\\x k — x\ 

i _ i_ i 

< 2fii e\\x k — x|| 4- fiie\\x k+ i — x\\i. 


This gives us the upper estimate 

\\x k +i - x\\ < 


2 Ve 9 


(1 - gieY 


■\\x k - x\\ g , 


which ensures the validity of (15.4)1 and thus completes the proof of the theorem. A 

It worth mentioning that the inverse implication also holds in (15.61) . which in fact follows 
from the proof of |2j Theorem 3] for q = 1; cf. also [8]. 

As shown by the examples of Section 3, strong higher-order subregularity (q > 1) holds in 
natural situations when metric regularity fails. The following simple example, where F(x) is a 
non-Lipschitzian function, illustrates the application of Theorem 15.11 in such settings. 


Example 5.2 (quasi-Newton method for non-Lipschitzian 2-subregular equations). 

Let g(x) := x 2 and F(x) := |x|2, x € K. Then x = 0 is a solution to the generalized equation 
m, which in this case reduces to a nonsmooth equation defined by a non-Lipschitzian function. 
As has been well recognized in the literature (see, e.g., m fT6] and the references therein), 
the vast majority of the results on Newton-type methods for nonsmooth equations concerns 
Lipschitzian ones, while non-Lipschitzian settings are highly challenging. Based on Example 13.21 
we conclude that F is strongly 2-subregular at (0,0). It is easy to check that the other conditions 
of Theorem o are also satisfied. Consider now the quasi-Newton method (I5.2p with 

B (2^r + (^)- 

*' (2 m ) _1 - (2(*+l)!)-‘ ' 

It is easy to verify that lim^oo \B k — Vg(x)\ = 0. Then for any starting point xq close to x, 
it follows that algorithm (1 5.2 jl generates a sequence {x k } = {(2 fc! ) -1 } converging to x = 0 with 
the convergence rate that exceeds q = 2. 


6 Concluding Remarks 

This paper demonstrates, for the first time in the literature, that the notions of metric q- 
subregularity and strong metric (/-subregularity for sent-valued mappings can be useful not only 
in the cases when q £ (0,1] but also for q > 1. This significantly differs metric g-subregularity 
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from metric (/-regularity, which does not make sense when q > 1. Besides various examples, 
characterizations of these notions and their sensitivity analysis, we provide applications to the 
convergence rate of quasi-Newton methods of solving generalized equations. It seems that it 
is only the beginning in the study and applications of these fruitful higher-order notions of 
variational analysis. We intend to develop more applications of higher-order metric subregularity 
and strong subregularity to various aspects of optimization; in particular, to convergence rates 
of other important algorithms in numerical optimization. 
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